StarT - Jr : A Parallel System from
نویسندگان
چکیده
StarT-jr is an experimental parallel system composed of a network of personal computers (PCs). The system leverages the momentum of the microprocessor and PC industries to achieve excellent single node performance at a low cost. For parallel processing, StarT-jr uses the Flexible User-level Network Interface (FUNi) to provide low-overhead, user-level inter-processor communication over two IEEE 1394 High Performance Serial Busses. This eecient message-passing mechanism enables StarT-jr to exploit ne-grained parallelism for good parallel performance. FUNi is based on an embedded processing system on a PCI card. Custom network hardware assembled from a commercial IEEE 1394 chip set provides FUNi with access to the IEEE 1394 network. In message passing, FUNi's embedded processor serves as a network coprocessor and manages an user-accessible message-passing interface in the host memory. User-level applications directly manipulate the interface location in host memory using cached reads and writes. Costly physical I/O accesses to device registers on the PCI bus are avoided. Currently, FUNi can eeciently support both ne-grain message passing and direct memory-to-memory transfers of large data blocks. FUNi can also support globally coherent shared memory by capturing and responding to memory accesses within a designated global address range. FUNi maintains a globally coherent shared memory cache to minimize global memory access latency. The necessary coherence protocol processing and communication is performed by the FUNi coprocessor. We have demonstrated a two-node prototype of StarT-jr and are awaiting fabrication of additional interface cards in order to assemble an eight-node system. StarT-jr currently supports an active message-based lightweight communication library for the C programming language. Preliminary measurements of the communication library demonstrated overheads of 14 sec for sending or receiving small (40 bytes) messages, and an user-to-user latency of 85 sec. Direct memory-to-memory transfers can sustain 3.4 MByte/sec on an unloaded network. With regard to the shared memory operation, reading a shared-memory location cached in FUNi takes approximately 2 sec.
منابع مشابه
START JR A Parallel System from Commodity Technology
StarT jr is an experimental parallel system composed of a network of personal computers PCs The system leverages the momentum of the microprocessor and PC industries to achieve excellent single node performance at a low cost For parallel processing StarT jr uses the Flexible User level Network Inter face FUNi to provide low overhead user level interprocessor communication over two IEEE High Per...
متن کاملاستخراج پیکره موازی از اسناد قابلمقایسه برای بهبود کیفیت ترجمه در سیستمهای ترجمه ماشینی
Data used for training statistical machine translation method are usually prepared from three resources: parallel, non-parallel and comparable text corpora. Parallel corpora are an ideal resource for translation but due to lack of these kinds of texts, non-parallel and comparable corpora are used either for parallel text extraction. Most of existing methods for exploiting comparable corpora loo...
متن کاملA fixed and flexible maintenance operations planning optimization in a parallel batch machines manufacturing system
Scheduling has become an attractive area for artificial intelligence researchers. On other hand, in today's real-world manufacturing systems, the importance of an efficient maintenance schedule program cannot be ignored because it plays an important role in the success of manufacturing facilities. A maintenance program may be considered as the heath care of manufacturing machines and equipments...
متن کاملComparing Parallel Simulated Annealing, Parallel Vibrating Damp Optimization and Genetic Algorithm for Joint Redundancy-Availability Problems in a Series-Parallel System with Multi-State Components
In this paper, we study different methods of solving joint redundancy-availability optimization for series-parallel systems with multi-state components. We analyzed various effective factors on system availability in order to determine the optimum number and version of components in each sub-system and consider the effects of improving failure rates of each component in each sub-system and impr...
متن کاملتقسیم بار بین اینورترهای موازی میکروگرید با استفاده از روش کنترل تکسیکلی
This paper investigate the properties of the circulating current and load sharing in parallel microgrid inverters. The existing methods cannot minimize efficient circulating current for the case of nonlinear load. The one-cycle control (OCC) technique and virtual complex impedance loop are included in the proposed controller in order to proper load sharing and reduce the circulating current due...
متن کامل